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THREE-DIMENSIONAL SHAPE MEASURING METHOD AND ITS DEVICE 

Background of the Invention 

[0001] This invention is related to a 3D measurement system that uses laser-light 
and estimates depth information of an object by applying a triangulation method; among these 
systems, this invention is especially related to a 3D measurement system that consists of a 
single camera to acquire accurate depth information. This invention also relates to a 3D 
measurement method using the 3D measurement system. 

[0002] To acquire a 3D shape of the object, an active stereo method based on the 
triangulation method is well known (pattern projection method and point projection method). 
These systems consist of light sources, such as laser-lights or halogen lights, and image 
capturing devices, such as cameras or CCD (Charge-coupled Device) cameras. 3D shape 
estimation is conducted by a triangulation method of these devices. Therefore, a precise 
relationship between the light source and the camera should be precalibrated, and thus, the 
whole system is prone to become large and complicated. 

[0003] If we can make the calibration process of the light source and the camera 
easier, an active 3D acquisition system becomes simpler. Based on such an idea, a method 
was proposed that achieves easy calibration of the light source and the camera by attaching 
markers on the light source itself and capturing it with a single camera (Masahiro Takatsuka, 
Geoff A. W. West, Svetha Venkatesh,and Terry M. Caelli. "Low-cost interactive active 
monocular range finder". In Proceeding of Computer Vision and Pattern Recognition, volume 
1, pages 444-449, 1999.). 
Summary of the Invention 

[0004] Because the Takatsuka method assumes a beam of light for the light source, 
and thus, 3D information of only a single point can be acquired from a single image, it causes 
a serious problem because it takes a very long time to capture 3D data of a large area. Another 
problem of their method is that, because markers attached to the light source are aligned along 
a single line, the precision of acquired 3D information is inevitably low. 

[0005] Thus, our invention aims to reduce the capturing time and improve the 
precision of 3D data of a 3D measurement system based on an active stereo method in which 
a marker attached light source and a single camera are used. 
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[0006] This invention achieves dense 3D data acquisition from a single image by 
using a line laser as the light source. With this feature, a 3D shape can be efficiently acquired 
in a short time. In addition, because of the three-dimensional configuration of the markers, the 
light source position can be estimated accurately. Thus, the precision of 3D data can be 
improved. 

[0007] Because the proposed method can acquire dense 3D points from a single 
image, if the acquired 3D points include 3D points already estimated in a past procedure, the 
position of the light source can be corrected by using the 3D points that are already estimated. 
Because of this, the accuracy of the 3D shape can be improved. 
Brief Description of the Drawings 

[0008] Figure 1 shows a configuration of measuring devices for image data as a first 
embodiment of this invention. 

[0009] Figure 2 shows a configuration of the measuring devices for image data as a 
second embodiment of this invention. 

[0010] Figure 3 shows an example of how to arrange the light-emitting diode(LED) 
markers, which are used in this invention. 

[0011] Figure 4 shows the relationship between the marker coordinate system, 
which is fixed to the line-laser source, and the camera coordinate system. 

[0012] Figure 5 shows a method for estimating the rotation of the marker coordinate 

system. 

[0013] Figure 6 shows a method for estimating the positions of the 3D points used 
in this invention. 

[0014] Figure 7 shows a method for estimating the difference between the marker 
plane and the laser plane. 

Detailed Description of Preferred Embodiments 

[0015] In the following, an implementation of this invention is described, referring 
to the figures. First, we disclose the implementation of a 3D measurement process using 
figure 1. At a first step, the users set up all objects so image capturing device(l-2) can capture 
both the light projecting device(l-l) and the target object(l-4) in a same frame. At the same 
time, an iris value and shutter speed are adjusted so that the laser light (1-5) projected onto 
the object and LEDs (1-lb) are accurately and simply detected by cameras using an image 
processing method. In addition, the relationship between the light projecting device (1-1) and 
the sheet of light formed by the line laser are measured in advance. In terms of an image 
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capturing device, for example, we can use either a video camera, a CCD camera and so on. At 
a second step, the users hold the light projecting device(l-l) by hand and project the laser 
onto the measured object (1-4). Because the 3D configuration of the LEDs is known, by 
detecting the positions of the LEDs on the image plane, the relationship between the light 
projecting device (1-1) and the image capturing device (1-2) can be calculated. For 
calculation, a nonlinear optimization is conducted to improve accuracy. 

[0016] Because the relationship between the light projecting device (1-1) and the 
laser plane is known, we can recover a depth of light projected area from the image capturing 
device (1-2) with a triangulation method using the relationship between the light projecting 
device (1-1) and the image capturing device (1-2). 3D shapes of the target objects are 
obtained by repeating the process. If obtained 3D shapes are displayed on a display device (2- 
5) in real-time as shown in Fig. 2, the users can learn of areas that have not yet been measured 
on site. Thus, users can efficiently scan the 3D shape of the target object (2-4). 

[0017] Examples of a marker configuration are shown in Figure 3. If the number of 
marker is more than 4, a marker configuration is basically free; however, a square-shaped 
configuration (3-1), an xyz coordinate-shaped configuration (3-2), and a cube-shaped 
configuration (3-3) have some advantage, such as easy manufacture and simple calculation 
for retrieving the relationship between the light projecting device (1-1) and image capturing 
device (1-2). 

[0018] In the following, a specfic implementation of a 3D measuring device and the 
method are described. 

[0019] Our system consists of a video camera and a laser-projecting device (see 
Fig.l). While measuring, the users hold the laser projector by hand and project the laser onto 
the measured object. The projected stripe is observed by the video camera. The shape of the 
target object is obtained by analyzing the video sequence. To estimate the position and 
orientation of the line laser-projecting device, LED markers are attached to the laser projector. 
LED markers are located so that they form a square and so that the square is placed on the 
same plane as the sheet of light. 

[0020] These markers define the local coordinate system in 3D space (4-1). We call 
this the "marker coordinate system" in this description. The origin of the marker coordinate 
system is placed at the center of the markers and we define the marker coordinate system by 
three unit vectors, el ? , e2 f and e3\ The first and the second vectors, whose directions are 
expressed by el 1 and e2', are taken so that they are parallel to the edges of the square formed 
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by markers, (el 1 is parallel to the light projecting direction of the laser projector and e2' is 
perpendicular to that direction.) The direction of the third vector e3 ? is orthogonal to the plane 
formed by the four markers. We also define the camera coordinate system by el, e2 and e3, 
where el and e2 are respectively parallel to horizontal and vertical directions of the image 
plane, and e3 directed towards the backward direction of the camera(4-2). The center of the 
camera coordinate system is defined as the center of the lens. See Fig. 4. 

[0021] Here, we name two planes. The first one is the plane spanned by el 1 and e2', 
which we call the "marker plane"(5-4). The second one is the plane where the laser light is 
projected, which we call the "laser plane." The transformation between the "marker plane" 
and the "laser plane" is represented by a 3D rotation matrix R and translation vector t. This 
transformation represents the marker positions from the image capturing device. See Fig. 5. 
When the shape of the target object is scanned, the emitted light from the markers is captured 
by the video camera. From the captured image frame, the locations of the four LEDs are 
obtained. In actual implementation, it can be achieved by simple thresholding and calculating 
the center of gravity of the connected pixels. By using the locations of the LED markers, a 
description of the lines on the image that go through the markers is calculated and the 
crossing points of the lines are also calculated. The crossing points are called Vanishing 
Points (VP).(5-2a) and (5-2b). If the lines are parallel on the image, the VP is defined as the 
point at infinity at the direction of the parallel lines. Let the 2D coordinates of these vanishing 
points be described as (g Qn 9 g ln \n = 0,l) in pixels, then, the 3D direction vectors(rl and r2) of 
those VPs are 

r m ={gon>gln-f)' C 1 ) 

where f denotes focal length in pixel and °' denotes transpose of vectors and matrices. We 
define e\ = ^/||^i||»0" = 1>2). The unit vectors e\ are used as an estimation of e\ . Estimation e\ 
is defined by taking a cross product of the two unit vectors as e r 3 = e[ x e 2 . Figure 5 shows 
relationships between VP and estimations e' n (i = 1,2). Using three unit vectors, estimated R 
of rotation matrix R can be described as: 

^ = (e 1 ,e 2 ,e 3 ). (2) 
Let 2D coordinates of images of first and second markers be (p n , p l2 ), (p 21 , p 22 ) . 3D 
locations in camera coordinates of the markers are 

m \ = (PxiV,Pi 2 u 9 -/uy 9 m 2 = (p 2l v 9 PnV-Jvy (3) 



5 

where u, v are unknown variables. Let the 3D distance between those markers, which is 
known value, be Dm. Then 

H-m 2 \\ = D m . (4) 
The equation of the marker plane is expressed as e[\x + d = 0 , where x is a 3D point 
expressed in camera coordinates. Because m l9 m 2 are on the marker plane, 

e>,+rf = 0(/ = l,2). (5) 

[0022] From equations (3)(4) and (5), d,u,v can be solved. Then we obtain i , as an 
estimation of / , by calculating the center of gravity of locations of the markers. 

[0023] To improve the estimation of the parameters of transformation, we optimize 
it with a nonlinear optimization. The evaluation function for optimization is defined as the 
sum of the squared distances between the actual LED positions on the image and the 
estimated marker positions on the image, which are calculated by projecting the LED position 
in local coordinates to the image plane by using estimated R and i . The function is 
described as 

/(£f)=2W(*»: +0--P* } ( 6 > 

proj((x x ,x 29 x 3 y)= (O, )/* 3 , (fx 2 )/x 3 y (7) 

where p. is a location of the image of the i th marker, and m\ is a location of the / th 

detected marker in marker coordinates. projO is an operation of projection from 3D position 
in camera coordinates into image coordinates. For actual implementation, we can use a 
simplex descending algorithm for the optimization. 

[0024] Then, using the estimated parameters of the marker plane, we can estimate 
the parameter of the laser plane. A detailed method will be described later. 
First, the pixels that are projected by laser light (6-1) are extracted from the image (6-5). The 
pixels are easily picked up by a simple thresholding method, however, for actual 
implementation, skeletonizing and/or morphological filters are applied to improve accuracy. 
The line going through the origin of the camera coordinates (6-4) and the surface point (line 
of sight) can be calculated for each pixel using intrinsic parameters, and the 3D location of 
the surface (6-3) can be determined by taking the intersection of the line of sight and the 
estimated laser plane(6-2). Fig.6 shows how to estimate surface locations by triangulation. 

[0025] Here, we explain the method to acquire a correct relationship between the 
marker coordinates and the laser plane. The relationship can be defined as the parameters of 
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the laser plane expressed in marker coordinates. The parameters can be expressed by (a, b, c, 
d) which represents a plane in 3D space. If the markers plane precisely corresponds to the 
laser plane, the equation of the laser plane is z=0. 

[0026] For preparation of the calibration between the marker coordinates and the 
laser plane, the following measurement must be conducted in advance. First, a box-shaped 
object (7-5), of a known size, is captured. From the image, the extrinsic camera parameters 
are estimated using methods of camera calibration. From the extrinsic camera parameters and 
the size, the equations of the planes enveloping the box are estimated. 

[0027] We project the laser light onto the surface of the target object, and capture 
the lit surface and the markers. From the markers, the transformation between the marker 
coordinates and camera coordinates can be estimated by the previously described method. 
Then, the laser plane expressed in the camera coordinates can be calculated from the 
estimated laser plane parameters (a, b, c, d) expressed in marker coordinates. Next, the 
intersecting lines of the laser plane and the surface of the box shaped object are calculated. 
Then, the images of the lines can be determined (7-2). 

[0028] We calculate the intersection of the laser plane estimated by the above 
method and the box-shaped object. Then, we estimate the image of the intersection and 
compare the image with the true positions of the image of the surface points lit by the laser 
(7-1). The squared distances between the pixels of the true image and the estimated image are 
summed up for all the pixels. The sum is the estimation function of the parameters (a, b, c, d), 
which depicts the laser plane expressed in the marker coordinates. By minimizing the 
estimation function using a non-linear optimization method, we can estimate (a, b, c, d). The 
initial value for the optimization can be, for example, z=0, which is equivalent with 
(a,b,c,d)=(0,0,l,0). See figure 7. 

[0029] Finally, we describe a method to correct the estimation of the marker 
coordinate system from known 3D points. When the 3D locations of some part of the 
measured surface are known and if some part of the projected stripe of the laser is on the 
region of surface whose depth values are known, the known depth value and the depth value 
calculated from the estimated laser plane should match. Suppose we define an estimation 
function that is the sum of function (6) and the sum of squared differences between the 
known depth value and the calculated depth value expressed above. The summation of the 
squared differences is calculated for all the region of the surface whose depth values are 
known. Then, we can correct the estimation of the marker coordinate system by minimizing 
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the estimation function. As for the known 3D points, we can use the 3D points precisely 
measured by other methods. Another example for the known 3D points is that we can select 
points that are measured many times and the variances of whose positions are small. 

[0030] As explained above, applying this invention, we can quickly acquire a 
precise 3D shape without using complex equipment, which is conventionally required for 
active 3D measurement by triangulation. Moreover, the users can interactively check the 
regions whose positions are not measured during the measuring process by displaying the 
acquired 3D positions in real-time; thus, measurement of 3D shape of the target object 
becomes efficient. 



